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PREFACE 


T his book is an attempt to provide, 
in a manner readily comprehensible 
to all those interested in the subject, 
an answer to some of the more important ques- 
tions that have been asked as to the nature, 
validity, and methods of application of mental 
tests, and the conclusions to be drawn there- 
from. It is not an attempt to theorize, but 
to illustrate general principles on a strictly 
empirical and experimental basis, and it is 
hoped that all who follow developments in 
education and in psychology may find some 
points of interest in it. 

The experiment described in the second part 
of Chapter V was carried out in the county of 
Northumberland. 

The note at the end of Chapter IV was pub- 
lished in The British Journal of Psychology 
(April 1922), and I am grateful to the Editor 
for permission to include it in this book. 
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Most of the chapters of the hook were kindly 
read by Mr Cyril Burt, to whom I am much 
indebted for valuable comments, 

C. A. R. 

Newcastle-om-Tyne 
Apnl 1922 
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Methods and Experiments 
in Mental Tests 

CHAPTER I 

INTRODUCTORY 

M ental tests are now becoming so 
familiar in the Enghsh educational 
world that it is unnecessary, in 
introducing a book such as this, to enter into 
a lengthy description of their nature. Pre- 
liminary experimental work has now, however, 
reached a stage where we may well pause to 
look round and to take stock of the results so 
far reached. 

In the first place, it is necessary to consider 
briefly the criticisms which have been levelled 
at the tests. These criticisms fall into three 
categories according as they are directed at 
(i) supposed faults inherent in the tests them- 
selves ; (2) difficulties in carrying out the 

alleged aim of the tests ; or (3) difiiculty in 
defining what this aim really is. 
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The first kind of critic will glance through a 
test booklet, seize on some particular test, and 
say confidently, “ This is far too difficult,” or 
“ This is far too easy,” or “ The mere sight of 
this will frighten the children (especially the 
nervous ones) out of their wits,” or ” How can 
you expect children to understand long words 
like this ? ” 

The answer to such a critic is simple and con- 
clusive He must be made to understand that 
the value of a test is not estimated by its appear- 
ance, but by what it is actually found to effect 
in practice. That is, he must be asked to look, 
not at the tests, but at their results. The pro- 
cedure of mental testing is rigidly empirical. 
All tests, before any conclusions are drawn from 
them, are tried on a number of children suffi- 
cient to determine whether they are suitable or 
not. In the case of a test devised in this way 
criticisms of the first type are therefore met in 
advance. 

Secondly, objections are sometimes urged 
against the possibility of fulfilling one primary 
aim of the tests, namely, to give a fair field and 
10 
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no favour. This objection usually takes the 
form of an assertion that the tests can easily 
be ‘ coached ’ for. It is true that in the present 
individual scales (such as the Binet scale) a 
few (not many) of the tests are markedly sus- 
ceptible to the influence of coaching, but even 
here all who have had experience in giving such 
tests will know that the presence of coaching 
can be detected with but httle difficulty. At 
most it is unlikely to add more than a few points 
to the child’s score. Moreover, in group testing, 
which is the most practicable way of carr5dng 
out tests on a sufficiently large scale, the possible 
variety, not only of the actual items of the differ- 
ent tests, but also of the particular forms which 
those tests may take (this being in effect limited 
only by the inventiveness of the human mind), 
is so great that coaching is rendered a wg.ste of 
time and labour. Even in a standardized group 
test which is placed upon the market the number 
and variety of items is sufficiently large tP 
nullify the efiects of coaching. In fact, the only 
point which can be sustained in this criticism 
is that, as children become familiar with mental 
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tests, the norms of performance may slightly 
improve owing to greater speed of getting to 
work and to absence of emotional disturbance 
due to confrontation with an unfamiliar kind of 
task. But £&iy such change in the norms wiU, 
of course, be automatically accounted for in 
the interpretation of results. 

Thirdly, it is frequently urged that although 
these tests are said to be tests of intelligence 
we are pnable to define what intelligence is. 
Now, in the first place, it cannot be too strongly 
emphasized that our ability to define intelli- 
gence IS a matter quite irrelevant to the ques- 
tion of the value of the tests. What is wanted is 
a method of separating the sheep from the goats 
— ^that is, the bright children, those who are 
capable of responding to our efforts to educate 
them,,, from the dull children, those whose 
capacity for response to educative influences is 
distinctly limited. If the tests do in fact 
enable us to grade children in order of their 
educability it does not matter in the least 
what we call them ; though, since we com- 
monly speak of bright children as ‘intelli- 
12 
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gent,’ ‘ intelligence tests ’ is probably as good 
a name as any. 

But apart from this we can frame a fairly 
precise definition of intelligence without much 
difficulty. In fact, it might even b5 regarded as 
synonymous with ' educability,’ the two terms 
being taken to imply the ability to acqmre 
knowledge and to use it. Intelligence is thus the 
capacity to organize in a coherent manner the 
confused mass of experience which pours in 
upon us through the channels of the senses. 
Or we may even carry our analysis a step farther, 
and search for a general factor which enters into 
all the processes which are ordinarily described 
as ‘ intelligent.’ To the writer it seems that 
attention constitutes such a factor, a belief 
which Binet himself appears to have held. The 
elementary intelligent processes — discriQiina- 
tion, comparison, analysis, s 3 mthesis — are all 
functions of attention. Intelligence would 
then be defined as " the functional efficiency oi 
attention.” 

But, it may be urged, if there is such a general 
factor, why is not everybody equally good, or 
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equally bad, at everything ? Why should not 
the great mathematician, for example, be an 
equally great historian, or zoologist, or literary 
critic ? The writer believes the answer to be 
that those sjiecial aptitudes with which we are 
so familiar in ordinary life are in each case the 
consequence of a combination of general intelli- 
gence and special interest, the latter determining 
the particular channels into which the former is 
directed.. For it is certainly a fact that good 
children, when young, are generally good all 
round, special tendencies becoming markedly 
apparent only during adolescence, when all 
kinds of interests and impulses hitherto latent 
begin to come into action. Moreover, a study of 
recent work in psycho-analysis will reveal to 
what a great extent the direction in which intelli- 
gent Activity turns is determined by causes of 
which the individual concerned is quite uncon- 
scious. 

There is, however, another experimental result 
at wliich the critics of mental tests frequently 
express incredulous surprise— namely, the fact 
that intelligence appears to cease developing 
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at something over seventeen years of age. 
But IS this so incredible ? Surely not when 
we remember the nature of intelligence. Intelli- 
gence is not knowledge, but the ability to organ- 
ize and employ knowledge. KnoVledge may 
grow indefinitely, but the growth of the ability 
to deal with knowledge may, like physical 
growth, cease comparatively early in life. 
The experienced, worldly-wise man of forty 
differs from the callow youth of sijcteen or 
eighteen not in ability to handle experience, 
but in having a far greater amount of experience 
to handle, and thus to apply in meeting new 
situations 

One final word as to criticism : the critics 
should remember that the results reached by 
those engaged in work on mental tests are the 
product of careful and extensive experimental 
work carried out in accordance with the usual 
rules of scientific procedure. Hence no criticism 
can be effective unless it is based on experiments 
equally careful and extensive. 

We have had occasion to draw a definite 
distinction between intelligence and knowledge 
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or attainment. The latter is a joint product 
of the former and environmental influences (of 
which, of course, teaching is one of the most 
important). But experience has shown that 
intelligence Itself is native to the child — a gift, 
good or bad, at birth, which no means known 
to us can appreciably improve. The bright 
child remains bright, the dull child dull, to the 
end of the chapter. We cannot increase intelli- 
gence, but we can enable the child to make 
the lest use of that amount of intelligence which 
he may happen to possess, and this is the true 
aim of all education. 

Intelligence tests, then, are in no sense tests 
of teaching. They differ from the ordinary 
examination chiefly in three important ways : 

(1) they are directed to the estimation of the 
childJ-s natural gifts, independently of such 
influences as home and social environment, 
and the effects of good or bad teaching ; 

(2) they aim at enablmg us to form a sufi&ciently 
accurate estimate of a child’s educabihty before 
that child has, in fact, been educated to any 
great extent ; (3) their results are expressed in 
16 
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terms of a purely objective standard and are 
unaffected by any such subjective factor as the 
personal equation of a particular examiner. 

With these aims in view the tests are so 
devised as to presuppose as little as possible 
beyond the fact that the child tested has been 
brought up (at whatever social level) under the 
ordinary conditions of a civilized community, 
and they are, of course, graded so as to be 
applicable to children of all ages and of all 
degrees of brightness. Their abihty to predict 
(whereas the ordinary examination is in general 
really effective only after the event) should be, 
if proven, of the greatest value in enabling us 
to determine for each child the method of 
education fitted to make the best of his particular 
level of intelligence. 

There are various methods of expressing 
degree of intelligence. One of the best known 
is by means of ‘ mental age ’ and ‘ intelligence 
quotient ’ (I.Q.). The mental age of a child 
is the age of the average child to whom he is 
equal in intelligence, and the I.Q. is the per- 
centage ratio of the mental age to the actual 
2 17 
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ige Thus a child of ten years may have a 
nental age of twelve years, his I.Q. then being 
120. Experiment has shown that the I.Q. 
of a given child remains nearly constant (which 
makes it a Very useful measure), so that the 
child m our example would, at the age of five 
years, have had a mental age of six years. 

A second way of expressing level of intelligence 
is by means of ‘ percentile rank.’ If a number 
of children be ordered according to degree of 
intelligence the percentile rank (P.R.) of any 
one of them is that percentage of the whole 
group which he just exceeds in intelligence. 
Thus if 70 per cent, of the children in the group 
are below the particular one selected in intelli- 
gence his P.R. will be 70. In view of what 
has been said of the nature of intelligence it 
is evident that the P.R. remains constant, for 
if a given child exceeds a certain percentage of 
all children in intelligence he will always exceed 
that percentage. 

A third method, which is often very useful and 
significant, is to express degree of intelligence 
in terms of ‘ standard deviation.* If a group 
18 
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of children of about the same age be tested, 
and the average or mean of their scores found, 
the deviation of each child's score from this 
mean is obtained and squared ; the squares are 
then added and their mean foun^ by dividing 
by the number of children. The square root 
of this mean is the standard deviation (S.D.). 
It is a measure of the ‘ scatter ’ of the intelligence 
of the group, the intelligence of a given child being 
expressed by sa5ung that his deviation, from the 
mean is so many times the standard deviation. 

Other useful quantities in connexion with the 
scores of a group of children are the ' median ’ 
— that is, the middle score of the group (namely 
that made by the child whose P.R. is 50) — ^the 
‘ lower quartile ’ (the score made by the child 
of P.R. 25), and the ‘ upper quartile ’ (the 
score made by the child of P.R. 75). Half the 
difference between the two quartzes is termed 
the ‘ semi-interquartile range ' (S.I.R.), and 
provides another measure of scatter and one 
easier to obtain than the S.D. 

Experiments in intelhgence tests turn largely 
on the comparison of the order in which children 
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are ranged by such tests with the order in 
which they are ranged by other kinds of tests. 
The results of this conaparison are expressed 
by means of what is termed a ‘ correlation 
coefficient.’ "it would be out of place here to 
enter into a description of the mathematical 
methods by which the degree of correlation is 
determined.^ It may be stated, however, that 
any value of the correlation coefficient numeric- 
ally greater than zero is presumptive evidence 
of the existence of a real connexion between 
the two sets of quantities compared ; but for 
small values of the correlation coefficient the 
‘ probable error ' (P.E.) is so large as to make 
the result practically insignificant. As the 
magnitude of the correlation increases, however, 
the probability of the existence of a real con- 
nexion becomes rapidly greater, and when the 
value is about .5 the probability of some 
connexion begins to approach reasonable cer- 
tainty. For values of the order .9 and upward 
the existence of a very close connexion is 

' For such a description see, e g , Brown and Thomson, The 
EssfinftiUs of Menial Measurement, Part II. 
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practically certain. Perfect correlation is repre- 
sented by the value i.oo. 

In the descriptions of the experiments which 
follow the writer has been concerned primarily 
to illustrate the general principles* involved in 
the examination of the validity of intelligence 
tests, and (if that validity is established) in the 
application of the tests for educational purposes. 
He has accordingly refrained from explaining in 
detail the more technical mathematical and 
statistical processes employed in obtaining some 
of the results.! 

In actually carrying out the experiments the 
writer had two chief aims in view. In the first 
place, he wished to make the investigation not 
merely extensive, but also intensive. In other 
words, he was not concerned simply to get mass 
results, but also to make a close study of 
individual children. For it is clear that we can 
never be satisfied with the tests until we are 
confident that they are not only reliable for 
groups of children considered as wholes, but 

^ Such explanations may be found elsewhere ; ^ ^ , in Brown and 
Thomson, op. c%t , Part II. 
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are also reasonably certain to do justice to 
particular children. We require them to be 
at least as certain in this respect as the older 
methods of selection. 

Secondly, •-the writer has relied as far as 
possible only on objective standards of com- 
parison. In judging intelligence tests a method 
frequently employed is to compare their results 
with teachers’ estimates. Now while no one 
would doubt the value of the estimate of a 
competent teacher, owing to the fallibility of 
human nature we can never feel quite the 
same certainty when a subjective factor, such 
as the personal equation of the teacher, enters 
into our calculations as we can when the 
standard of reference is purely objective. A 
man may be trained to estimate distances with 
som^ accuracy merely from observation, but 
we should never trust his estimate as we trust 
the foot-rule. In this connexion, therefore, 
it is necessary to state only that the writer, 
like others, has found close general agree- 
ment between teachers’ estimates and results 
of the tests, 
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The chief problems attacked in the experi- 
ments to be described were the reliabihty of 
the individual test and of the group test as 
indexes of educability, and the methods of 
interpreting scores in group tests i» such a way 
as to form an estimate of the mental capacities 
of the subjects tested — ^in particular, the methods 
to be adopted in the difficult case of adults and 
adolescents. 
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CHAPTER II 


THE RELIABILITY OF THE STANFORD-BINET 
SCALE OF INTELLIGENCE TESTS AS AN 
INDEX OF EDUCABLE CAPACITY 

I NTRODUCTION. The problem of deter- 
mining the use that can be made in 
educational work of intelligence tests in 
general, and of the Stanford-Binet scale in 
particular, reduces, as we have seen, essentially 
to this : Is it possible to foretell by means of 
these tests the limitations of a child’s capacity 
for being educated, and hence to lay down the 
appropriate hnes along which he should be 
taught ? 

The experiment herein described was an 
attempt to obtain decisive evidence of a quanti- 
tative nature on this point. Evidently the most 
direct method to adopt was to range a number 
of children in order of intelligence as determined 
by the S.B, scale, and then to arrange them in 
order of educable capacity by some entirely 
different (though sufficiently accurate) means 
24 



THE STANFORD-BINET SCALE 

of estimation. The two orders could then be 
compared, and the correlation between them 
calculated. 

For this purpose five batches of children 
(about twenty in each batch) were taken from 
five different schools. In order to ensure that 
the children had for some time had sufficiently 
favourable opportunities of learning, the schools 
selected were those known to be comparatively 
efficient, and the age range of the children was 
from ten to twelve, so that they had been at 
school for some years. In these circumstances 
the children’s educable capacity could evidently 
be estimated by discovering what they had 
actually shown themselves capable of learning. 
As indexes of this capability arithmetic and 
English composition were chosen as being the 
most fundamental, as well as the most repre- 
sentative, subjects of the curriculum. 

The children included members of both sexes, 
and they ranged in intelligence from exceptional 
superiority to definite mental defectiveness. 
In order to discount the effect of such hetero- 
geneities as difference of sex, different methods 
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of instruction, etc., each batch was considered 
separately, in addition to the consideration of 
the group as a whole. 

Every child was tested individually by the 
S.B. scale, and an order drawn up from the 
results. 

In estimating the significance of what follows 
it should be borne in mind that extensive ex- 
periments in America — ^in the course of which 
children were retested under var 5 dng conditions, 
by different examiners, and at varying intervals 
— ^have shown that the intelligence quotient of 
a given child remains practically constant. 
Indeed, the average variation of I.Q., even 
over intervals of several years, is no more than 
about 5 per cent.^ Hence, had the children in 
the experiment to be described been tested when 
entering, say, the senior school— i.e., at about 
seven years of age — instead of at about eleven 
years of age, the order obtained would have 
been practically the same. 

After being tested by the S.B. scale the 

^ Cf , e g , Terman, The Intelhgence of School Ch%ldren (Harrap), 
Chapter IX, especially pp* 138 £E 
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children were set a paper in applied arithmetic. 
For this purpose the paper given on pp. 193 ff. 
of Dr Ballard’s Mental Tests was used. On 
the child’s performance in this test it was 
possible to assign to him an ‘ arithmetical ’ 
mental age, and hence to find his ‘ arithmetic 
quotient ’ (A Q.) by a process similar to the 
calculation of the I.Q. 

At this point it should be noted that the 
norms of performance taken differed somewhat 
from those given on p. 195 of Mental Tests. 
It became apparent early in the experiment 
that these were too low, and in fact Dr Ballard 
mentions the possibility of this on p. 195 of his 
book. The norms he gives are in arithmetical 
progression, and it therefore seemed probable 
that any increase due to a return of more 
favourable conditions of education would be 
marked by a multiplication of his norms by a 
constant factor {i.e., the same for each norm 
at a given time). The first batch of results 
indicated an increase of about 25 per cent, in 
the norms, and this was strikingly home out 
as further results came in. 
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In composition the children were required to 
write on one of the following subjects : 

(1) How I spent my Easter Holiday. 

(2) Describe what you have seen of the 
River X and' its banks. 

(3) Write a story called Look before you Leap. 

(4) Write about this piece of poetry ; 

Bright yellow, red, and orange. 

The leaves come down in hosts , 

The trees are Indian princes, 

But soon they’ll turn to ghosts. 

As was expected, about 75 per cent, of the 
children chose the first subject, but this did 
not matter, as little difficulty was found in 
difierentiating them. 

In order to obtain an accurate standard a 
panel of five experienced examiners was formed, 
and every composition was marked indepen- 
dently by each examiner. A special method of 
marking was arranged. The examiners did not 
know the age of any of the children, and they 
were asked to mark a composition by assigning 
the child an age on it ; that is, a judgment of 
this kind was formed ; " This composition is 
28 
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about equal to that of the average child of 
X years of age.” The mark x was then awarded. 
The mean of the five markings was found, and 
this was taken as the child’s mental age as 
regards attainment in English composition. In 
awarding marks the examiners were asked to 
give primary regard to power of thought and 
to ability to express and arrange ideas in a 
logical and coherent manner. It may be 
remarked that, in the majority^ of cases, 
there was fairly close agreement between the 
examiners. 

From the mental age thus obtained the 
composition quotient ’ (C.Q ) was calculated. 

An order was drawn up by taking the mean 
of the two mental ages (for arithmetic and 
composition respectively) for each child, and 
another by taking the mean of the A.Q. gflji 
C.Q. (giving what may be called the ‘ educability 
quotient ' — E.Q.). Evidently this last order 
is the significant one from the point of view of 
educability. 

Results of the Experiment. In working 
out the correlations for the separate batches 
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Pearson’s correction of Spearman's formula for 
ranks was employed (this was sufficiently accu- 
rate for correlations of the order obtained). 

Calculated from the quotients the results were 
as follows ; • 


Correlation between 
educability and gene- 
ral intelligence as 
measured by the 
scale 


Group Group Group Group Group 
ABODE 
(Boys) (Girls) (Boys) (Girls) (Boys) 

.93 -75 88 .92 .91 


Calculafed from the mental ages the corre- 
lations were ; 


Group A Group B Group C Group D Group E 

93 -77 -89 -93 -92 

It will be seen that these correlations are 
exceptionally high.^ Moreover, the deviations 
from perfect correlations are almost entirely 
dufijto not more than about 15 per cent, of the 
children. For example, in Group B, where the 


^ It must, of course, be remembered that the high magnitude of 
the correlations here and m the sequel is m part due to the wide 
range of ability of the children tested, although this does not in 
any way vitiate the result In the case of a group of children of 
approximately the same abihty, such as a fairly umform class, this 
homogeneity would alone tend to reduce the correlation, apart from 
any other factor, 
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correlations are lowest, the deviation was due 
very largely to three of the children, apart from 
whom the correlation coefficient would have 
been about .9. In these cases there were almost 
invariably obvious explanations ^of the dis- 
crepancy. Thus many of the cases were those 
of young, bright children who had not been 
given sufficient promotion, and therefore had 
had no opportunity of reaching a level of 
attainment appropriate to their mental age. 
Other causes of fall of E Q. below I.Q. were 
late entry, bad attendance, inferior home con- 
ditions, serious emotional disturbance, physical 
defects, and malnutrition. Cases of marked 
rise of E.Q. above I.Q. are rare (amounting in 
this experiment to only about 4 per cent, of 
the whole) and seem to be generally attributable 
to a combination of mediocre or inferior intelli- 
gence with some temperamental characteristic 
such as capacity for hard work, or with home, 
school, or social conditions distinctly above the 
average. In such cases, however, it is probable 
that the superiority of E.Q. to I.Q. is not 
maintained to the same extent as the children 
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grow older. At all events, it is clear that 
the few discrepancies noted, far from detract- 
ing from the reliability of the S.B. scale, 
rather went to confirm its value when properly 
used. 

At the other end of the scale converse res-ults 
add further confirmation. For of the children 
of markedly inferior intelligence every one was 
placed in the attainment tests in practically 
exactly the position assigned to him by the 
S.B, scale. Hence the verdict of the latter — 
namely, that these children, even in favourable 
circumstances, were incapable of learning much 
— ^was completely vmdicated. 

One particular point is of interest in the last 
connexion. The norm of performance in the 
arithmetic test, at eight years old is | of a 
mark. Not a single child whose mental age, 
as given by the S.B. scale, was below eight years 
scored a mark in the arithmetic paper. The 
significance of this becomes clear in the light 
of the facts that (i) some of these children 
were actually twelve years old; and (2) the 
first problem in the arithmetic paper was as 
32 
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follows : “If there are loo apples on a tree and 
the wind blows down 17, how many are left on 
the tree ? ” Some of the children diagnosed 
by the S.B. scale as mentally defective wrote 
pages of figures, but without scoring a single 
mark. 

The group of 100 children was then considered 
as a whole. The rank correlation between the 
results of the educational tests and those of the 
S.B. test was obtained for the whole group. 
Calculated from the mental ages the correlation 
was .90, and from the quotients .89. The 
absolute (as distinguished from the ‘ rank ') 
correlation between the I.Q.s and E.Q.s of the 
members of the group was also calculated by 
the Bravais-Pearson product-moment formula, 
and was found to be .89. The probable 
error in these three correlations was of the 
order .01. 

It will be seen from the table at the 
end of this chapter that the correspondence 
between I.Q. and E.Q. was, in the majority 
of cases, strikingly close. For the whole 
group the distributions of both I.Q. and E.Q. 
3 33 
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approximated to normal. The constants of 
these distributions were : 


LQ. 

Highest 149 
Lowest 61 
Mean 104 36 
Median 102 
Seim-interquartile 
range 13 

Standard deviation 
19.13 


E.Q. 

Highest 138 
Lowest 64 
Mean 102.33 
Median 102 
Semi-interquartile 
range 9 5 

Standard deviation 
14.56 


It will ’be seen that the scatter of the 
E.Q.s is markedly less than that of the I.Q.s, 
probably indicating insufficient elasticity of 
promotion so far as the brighter children are 
concerned. 

Lastly, it is important to remember that, for 
reasons already pointed out, the S.B. scrie 
would have arranged these children in practi- 
cally the same order had they been tested at 
(say) about seven years old instead of at about 
eleven. In other words, the order of intelligence 
given by the S.B. scale at about seven years old 
would correlate with the order of the children’s 
educational attainment at about eleven years old 
34 
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'to nearly the same degree as the correlations 
given above. Hence, by testing at seven years 
old it would be possible accurately to predict 
educational attainments after some years’ 
interval — that is, in effect, to fosm an exact 
estimate of educable capacity. 

Conclusions. Here, then, we have a num- 
ber of children of both sexes, educated in 
different schools, and of all levels of intelligence, 
and it turns out that their general intelligence, 
as assigned by the S.B. scale, correlates to a 
high order with their educable capacity as 
estimated from what they have actually shown 
themselves capable of learning in favourable 
circumstances. In about 85 per cent, of the 
cases the correlation is practically perfect, while 
in the remaining 15 per cent, there are generally 
clear reasons for the discrepancy. 

It is true that, for many purposes, the number 
of children involved would not be sufficient to 
warrant the induction of a general principle ; 
but it should be pointed out that the circum-' 
stances were such as to render the quantitative 
evidence ahorded by this experiment practically 
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decisive in favour of the reliability of the S.B.‘ 
scale as an index of educable capacity, especially 
when this evidence is taken in conjunction with 
the mass of qualitative evidence accumulated 
in America comparison with teachers’ esti- 
mates, following up of school careers, etc. For 
there does not appear to be any reason why 
this group of children should differ markedly 
from other groups of children, educated under 
sufficiently favourable conditions, in such a way 
as to modify the results of the experiment 
materially. In other words, there seems no 
reason to doubt that this group may be taken 
as a typical example of groups of children who 
have been afforded satisfactory opportunities 
of learning ; and, if this be granted, it evidently 
follows that conclusions true of this group would 
also be true, in their main principles, of any 
similar group. 

There are also certain subsidiary conclusions 
of interest. Looking at the matter from the 
converse point of view, it follows that tests in 
arithmetic and English composition are good 
indexes of educable capacity, always provided ihcd 
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(i) the tests are properly standardized ; (2) the 
children to whom they are appHed have been 
educated in favourable circumstances ; and 
(3) due allowance is made for the ages of the 
individual children. 

Attainment in arithmetic seems to correlate 
with general intelligence rather more closely 
than does attainment in composition, for whereas 
the respective correlations for the five groups of 
children between general intelligence and arith- 
metical attainment were .91, .78, .87, .92, .88 
(calculated from the quotients), or .92, .83, .90, 
.93, .86 (calculated from the mental ages), the 
correlations between general intelligence and 
attainment in composition were .93, .42, .72, 
.84, ,70 (calculated from the quotients), or 
.89, .32, .80, .82, .82 (calculated from the mental 
ages). Probably the lower correlation in the 
case of composition is partly real, partly appar- 
ent, Partly apparent, for, even with five 
independent markings, it is not possible to 
obtain an assessment of ability m composition 
of the same accuracy as the assessment of 
arithmetical ability. Partly real, for it seems 
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probable that attainment in composition depends 
on other factors (such as home and social 
environment) besides intelligence and education 
to a greater extent than does attainment m 
arithmetic. ■^The reason for the comparative 
breakdown of the correlation as regards com- 
position in Group B is not altogether clear. 

If the S.B. scale is accepted as a reliable index 
of educable capacity children might well be 
tested by it twice during their school life, namely 
at about seven years of age, and again at about 
eleven years of age.^ The first test would 
indicate the child’s educability with sufficient 
accuracy to determine the methods according to 
which he should be taught in the senior school. 
The second test would serve to check the I.Q. 
obtained from the result of the first test, and 
would also settle the question of the child’s 
capability of profiting by advanced instruction. 

1 For cliildren of eleven, however, it will probably be necessary 
to use group intelligence tests, owing to the immense saving of 
time thereby etEected. 
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Child’s 

No 

IQ 

EQ 

Child’s 

No 

IQ 

EQ 

Child’s 

No. 

IQ. 

EQ 

I 

67 

71 

37 

91 

104 

72 

96 

98 

2 

141 

138 

38 

II 2 

113 

73 

lOI 

100 

3 

lOI 

107 

39 

98 

107 

74 

96 

100 

4 

126 

124 

40 

95 

91 

75 

122 

II 2 

5 

73 

69 

41 

85 

88 

76 

no 

105 

6 

133 

125 

42 

III 

II4 

78 

95 

97 

7 

61 

64 

43 

142 

130 

7 § 

102 

107 

8 

II 2 

100 

44 

99 

89 

80 

lOI 

102 

9 

109 

105 

45 

92 

91 

81 

102 

97 

10 

119 

98 

46 

I16 

108 

82 

89 

88 

II 

125 

108 

47 

83 

81 

83 

89 

92 

12 

8x 

100 

48 

114 

106 

85 

I3I 

118 

13 

123 

113 

49 

90 

92 

86 

115 

no 

14 

78 

81 

50 

91 

84 

87 

71 

82 

15 

I 2 I 

II 2 

51 

80 

83 

88 

138 

130 

i6 

75 

67 

52 

88 

87 

89' 

128 

120 

17 

122 

120 

53 

96 

89 

90 

121 

112 

i8 

102 

97 

54 

92 

96 

91 

128 

120 

19 

117 

III 

55 

133 

107 

92 

129 

119 

20 

139 

123 

56 

112 

106 

93 

102 

107 

21 

108 

114 

57 

120 

no 

94 

102 


22 

107 

100 

59 

III 

121 

95 

105 

104 

23 

71 

75 

60 

130 

108 

96 

107 

108 

24 

113 

108 

61 

92 

105 

97 

107 

100 

27 

92 

lOI 

62 

86 

94 

98 

115 

no 

28 

144 

125 

63 

no 

102 

99 

108 

103 

29 

149 

117 

64 

113 

117 

100 

88 

95 

30 

108 

99 

65 

102 

107 

lOI 

96 

82 

31 

103 

lOI 

66 

67 

76 

102 

78 

98 

32 

93 

92 

67 

89 

95 

103 

91 

93 

33 

93 

90 

68 

128 

121 

104 

99 

98 

34 

113 

102 

69 

84 

94 

105 

94 

94 

35 

82 

102 

70 

78 

92 




36 

117 

114 

71 

134 

134 





The missing numbers are those of children who were 
absent from some of the tests. 
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CHAPTER III 


THE DERIVATION OF MENTAL AGES FROM 
SCORES IN A GROUP TEST 

I F a number of children have been examined 
by means of a group intelligence test we are 
able, by comparing their scores, to arrange 
them in order of relative intelligence. This is 
sufficient for the important purpose of classifying 
the chUdreii according to the degree of develop- 
ment which their intelligence has reached. But 
it is quite inadequate for other, and equally 
important, purposes. For we cannot compare 
the real degrees of intelligence of the children 
until we have taken their ages into account. 
For example, one child may score more than 
another in the test, but the first child may be 
fourteen years of age, while the second may 
be only ten. Thus, although the intelligence 
of the first has developed more than that of the 
second (so that he would be classified higher), 
the latter may in relative development be ahead 
of the former. That is, the ten-year-old may be 
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more developed than was the fourteen-year-old 
at ten years of age. Therefore, before we can 
be in a position to judge as to which child is 
really the more intelligent we must somehow 
manage to reduce them to a common denomi- 
nator by taking their ages into account. 

The process we have indicated is carried out 

(as we have seen in another connexion) by 

assigning to each child a mental age, dividing 

this by the actual age, and multiplying by loo 

# 

to obtain the I.Q. The question then arises as 
to how mental age is to be obtained from score 
in the test. The mental age is the age of the 
average ’ child to whom the given child is equal 
in intelligence. It will therefore be necessary 
to determine the score of average children in 
the test, or, to put it another way, to calculate 
the average scores in the test of children ip 
various age groups. Thus we may take all the 
ten-year-olds tested, and find their average 
score ; similarly with the eleven-year-olds, and 
so forth. But if our results are to be valid the 
age groups must be sufficiently large and sufii- 
ciently random to constitute fair samples. 
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Having obtained these average scores or 
age norms ’ we proceed to find the mental age 
of any given child as in the following example : 
Suppose a child of ten scores 6o, and we find 
that 6o is the average score of the twelve-year- 
olds. Then the child’s mental age, as measured 
by the test, is twelve, and his I.Q. is 120. 

But is the mental age, as measured by the test, 
the true mental age, i.e., does it express with 
sufiicient accuracy the child’s standing relatively 
to the average child ? For the aspects of 
intelligence are so many and various that it is 
not easy to devise a test to measure them all, 
and it is possible that we may miss some 
point in which the child shows up particularly 
well. 

Now unfortunately it is possible to assign a 
miental age only by means of some test, and no 
test is perfectly comprehensive. What is needed, 
then, is a specially comprehensive test which, 
though not perfect, will serve as a standard b}!.. 
which the results of other tests may be judged. 
Probably the most accurate and comprehensive 
scale of tests at present available, at any rate 
42 
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for children up to about fourteen years of age, 
is the Stanford-Binet scale. It will therefore 
be important, after derivmg mental ages from 
the group test in the way indicated above, to 
check these mental ages by comparison with 
the corresponding Stanford-Binet mental ages 
in the case of children who have been tested 
both by the group test and by the Stanford-Binet 
scale. 

We may now proceed to illustrate the 
principles we have laid down by an example 
from actual practice. 

472 children, ranging from ten to thirteen 
years of age, were tested by Form A of the 
Terman Group Test of Mental Ability, for which 
no age norms of performance had at that time 
been published. After the examination the 
booklets were scored and the average scores or^ 
age norms for different age groups calculated. 
A ' curve of performance ’ was then obtained 
by plotting average scores for various ages 
against those ages.^ Curves of performance in 

1 This procedure will give valid results only if the distribution 
of scores in each age group is approximately normal — as it actually 
was m the experiment here descnbed. 
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I 

group tests of this kind usually approximate, 
over a considerable range, to straight lines, ^ 
and the curve obtained in this case was no 
exception. The age norms foimd were : 

Age in ye^-s loi io| iij iif 12J laf isj 
Average score 48 56 65 71 82 83 88 



Fig. I. 

3 . The true curve of performance is probably an ogive (c/. foot- 
note on p, 65)* The straight lines obtained as described m this 
chapter are simply mtended to be close approximations to a large 
portion of the ogive. The simple nature of the equation to the 
straight line renders it very convenient to use in practice. 
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The section AB is a straight line, the points 
corresponding to ages loj, loj, iij, 12 J years 
respectively lying practically exactly on this 
line, while the point corresponding to ii| years 
is very nearly on it. As to the eccentric section 
BC a word of explanation is necessary. After 
the age of about 12 years many of the brighter 
children have left the elementary schools for 
secondary and other schools. Hence the age 
groups above 12 become less and less representa- 
tive samples, and their performances' will drop 
below what might have been expected from a 
consideration of the performances of the younger 
children. We must therefore neglect the portion 
BC of the curve of performance, and replace it 
by the dotted line BK, which is an extension 
of the straight line AB. The higher mental 
ages should then be obtained, not from BC, 
but from BK, which represents the average scores 

Q 

of higher age groups on the assumption that 
intelligence continues to develop at the same 
rate as for the younger age groups.* 

If we call the mental age y and the score in 

* This point will be elucidated more fully m Chapter IV. 
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the group test x, the straight line ABK is 
represented by an equation of the form 

y = mx + c 

which is the typical equation for a straight line. 

For ABI^it can easily be shown that, if y be 
expressed in months, m = and c = 90, Hence 

y — ^ + 9® 

that is, the mental age can be derived from the 
score by the following formula ; 

Mental kge in months = of score) + 90 

Of course, once having obtained the straight 
Ime ABK, it would be possible to read off from 
it directly the mental age corresponding to any 
given score without working out the formula. 
Also a table might be drawn up from the line 
or from the formula giving the mental age for 
various scores. Had the curve of performance 
not been a straight line its equation would have 
been more complicated, and the determination 
of the formula might have presented some diffi- 
culty. In such a case it would be more convenient 
to construct a table direct from the curve. 

So far so good — ^but a fresh point now arose. 
46 



DERIVATION OF MENTAL AGES 

It was known that the batch of children tested 
was likely to be, on the whole, above the 
average of the total population. Hence this 
group of children set a standard above the 
average, so that the mental ages given by the 
above formula would be lower than the true 
mental ages {i.e., the mental ages obtained rela- 
tively to a fair sample of the whole population, 
for such a sample would set a lower standard 
than the particular group of children tested). 
Therefore the formula could be regarded only 
as a first approximation to the truth, good 
enough for arranging these particular children in 
their order of intelhgence, but needing to be 
checked and modified where necessary if it was 
to be used for obtaining true mental age. 

The formula was checked in the following 
way : One hundred children were tested indivi- 
dually by the Stanford-Binet scale, and were 
then given the Terman group test. A curve was 
plotted giving the relation between score in the 
group test and S.B. mental age. As was 
expected, this curve gave somewhat higher 
mental ages for given scores than did the line 
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ABK in Fig. i. The number of high scores 
was too few to form a basis for safe generalization, 
but in the higher ranges there was an apparent 
tendency for the new curve and the line ABK 
to converge toward one another. This looked 
as if the formula obtained from ABK was more 
accurate for high scores than for low. 

Another check became available later. In a 
more recent edition of the Manual of Directions 
for the Terman group test^ there appeared a 
table, based on 306 cases, giving tentatively the 
probable correspondence between scores and 
S.B. mental ages. The line plotted from this 
table was found to agree very closely indeed 
in the middle ranges with the curve obtained 
from the 100 cases mentioned above. But 
for the lower ranges the curve gave rather 
lower mental ages than the new line. It is not 
improbable, however, that the curve is more 
trustworthy here, for Terman mentions in a 
footnote that the mental ages given in his table 
are likely to be somewhat too high. 

^ C/. p. 10 of the Manual of Dtrections in the 1921 American 
©ditlOE, not yet obtainable in England. 
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The relation between the original line ABK, 
the curve obtained afterward by comparison 
with the S.B. scale, and, finally, the line plotted 



from the data given by Terman is represented 
in Fig. 2, which is drawn only approximately 
to scale. 

In Fig. 2 AA represents the performance of 
the 472 children in the first experiment (ABK 
in Fig. i), BB the curve afterward obtained by 
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the writer from the comparison of the scores 
of the 100 children with their S.B. mental ages, 
and CC the line plotted from Terraan’s data 
referred to above. 

The following points will be noted : 

(1) The convergence of AA, BB, and CC as 
the higher scores are approached. 

(2) The coincidence of BB and CC in the 
middle range. 

(3) AA and BB both give lower mental ages 
for the lower scores than CC. 

Considering AA, BB, and CC in conjunction, 
and allowing so far as possible the appropriate 
weight to each, we are led to a line in the position 
of the dotted line in the figure as probably a close 
approximation to the true representation of the 
relation between mental age and score in the 
Terman group test. 

When this line was plotted and its equation 
expressed in the form 

y — mx + c 

it was found that, compared with the original line 
AA, ni had diminished from -irs to about ^ or 
while c had increased from 90 to about no. 
50 
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Thus the new equation will be very nearly 
y = AX + no 

so that mental age may be obtained from score 
by the formula 

Mental age in months = (f oS score in 
Terman group test) + no 

Notice that for small scores the first item on 
the right will be unimportant compared with 
the second term — the constant no ; while as 
the score increases the first term becomes more 
and more important. Thus the lower mental 
ages assigned will depend very largely on the 
value given to c (which we have taken as no), 
while the higher mental ages will depend more 
or less equally on c and m (which we have taken 
asf). 

Now the value of m depends on the slope of 
the line, and it will be seen from Fig. 2 that AA 
and CC do not differ greatly in slope, while the 
slope of BB, though it varies somewhat from 
point to point, is on the whole much the same 
as that of AA and CC. For AA m — for 
CCm = about |*|. Therefore there is unlikely to 
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be a serious error in the value we have taken for 
m, namely The value of c, on the other hand, 
depends on the value of y for which x vanishes ; 
that is, it represents the mental age in months 
for which the score becomes zero — the age below 
which the average child cannot even begin 
to do the test. For AA c = go, a mental age 
of 7| years. For CC c = about ii6, a mental 
age of 9 years 8 months. There is a considerable 
discrepancy between these two values of c, and 
we should therefore find somewhat serious 
differences in the lower mental ages according as 
we adopted one or the other. 

But it is possible to get nearer to the true value 
of c. For it was pointed out that the formula 
y — Tt5 X + go gives mental ages which are too 
low, and we saw that the error increases as the 
score diminishes. On the other hand, while the 
mental ages found from CC are likely to be too 
high, the comparison here is at any rate one 
between score and true or S.B. mental age. 
The value ii6 for c is thus indicated as being 
considerably nearer the truth than the value go. 
Moreover, the curve BB points to a value for 
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c of about io6, or a mental age of 8 years lo 
months. It is therefore improbable that there 
is a serious error in the value we have provision- 
ally decided to take for c, namely no, or a 
mental age of 9 years 2 months. But it will 
be clear that the precise determination of this 
constant is important. 

While the formula y = ^ x 110 probably 
gives with some accuracy the correspondence 
in general between true mental age and score in 
the Terman group test, it must not be forgotten 
that errors may occur in individual cases. If 
the tests were perfect, children testing at the 
same mental age on the S.B. scale should score 
the same in the group test. But in practice 
it not infrequently happens that children of the 
same S.B. mental age make considerably different 
scores in the group test. The reason is not far 
to seek. In the group of 100 children who were 
given both tests all cases where equal or nearly 
equal mental ages were correlated with different 
group test scores were carefully analysed. In 
nearly all such cases it at once became evident 
that the discrepancies were due to the fact 
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that the two tests did not cover exactly the 
same ground. Certain aspects of intelligence 
to which some of the S.B. tests are directed 
are not touched by the Terman group scale. 
For example, that function of intelligence which 
determines control of visual imagery is not 
specifically tested by the latter, while in the S.B. 
scale there are at least two tests (No. 6 in 
year xiv, and No. 2 in ' superior adult ' group) 
which depend primarily on the manipulation 
of visual imagery. Thus, if a child made a 
lower score than might have been expected 
from his S.B. mental age, it was found on 
analysis that he had shown up well when 
tested by the S.B. scale in some direction, 
such as control of visual imagery, which is not 
specifically tested by the Terman group test. 

On the whole, however, the distribution of 
points obtained by plotting the S.B. mental 
ages against the group test scores shows that 
the S.B. mental ages of two children with the 
same score are much more likely to agree fairly 
closely than to differ widely. But, at the same 
time, it is clear that group tests should be 
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devised in such a way as to make them as 
comprehensive as possible. Otherwise in indi- 
vidual cases different scales may give consider- 
ably different results ; and we cannot feel 
complete confidence in the scientific validity 
of our scales as trustworthy instruments of 
measurement with reference to an objective 
standard until we have advanced to a point 
where different scales, when applied to the same 
child, will give results in close agreement.^ 

1 The writer may perhaps be permitted to refer here to the 
Simplex Group Intelligence Scale (Harrap), which he has recently 
published, and which is devised to form a scale sufficiently com- 
prehensive as regards both the various aspects of intelligence 
which it covers and the age range of the children to whom it may 
be suitably apphed. The Simplex scale was admmistered to about 
a hundred children whose mental ages had been previously estimated 
by several other methods (mcludmg the S B individual test) The 
correlation between these mental ages and the scores m the Simplex 
test was found to be .94 There were only three marked dis- 
crepancies, a proportion so small that it might well have been due 
to accidental causes. Neglecting these three cases, the correlation 
rose to .97, which is very high mdeed. 
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CHAPTER IV 

METHODS OF ESTIMATING THE TRUE INTEL- 
LIGENCE QUOTIENTS OF ADULTS AND 
ADOLESCENTS 

T he results of experiment seem to show 
that intelligence grows at an approxi- 
mately constant rate up to the age of 
something over fourteen years. We may term 
this age the ‘ critical age.' Beyond the critical 
age the growth of intelligence slows down until 
it comes to a complete stop at the age of some- 
thing over seventeen years. The age at which 
this cessation of growth occurs will be called 
the ' terminal age.’ 

As we have seen, a convenient way of express- 
ing the absolute level of intelligence of a given 
child at a given time is by means of ‘ mental 
age.' The rate of growth of intelligence will 
then be the ratio of the mental age to the actual 
age. This ratio, expressed as a percentage, is 
the ‘ intelligence quotient,' and is found to be 
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approximately constant for the same child from 
early childhood up to the critical age, though 
different for different children. The I.Q, 
(which really represents the rate of mental 



growth) is thus a highly convenient index 
of a child’s degree of intelligence. Fig. 3 
represents the growth of intelligence of three 
children, one inferior, one average, and one 
superior. 

It is by no means certain, though for conveni- 
ence it has been assumed in Fig. 3, that the 
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critical and the terminal ages are approximately 
the same for all children.^ 

The child’s degree of intelligence is, then, 
given by the I.Q., which represents the rate of 
growth during the period when that rate remains 
nearly constafit, namely below the critical age. 
But a difficulty arises when a subject is not 
tested until the adolescent or adult period, for, 
owing to the slowing down of mental growth, 
the ratio of mental age to actual age will change, 
and will therefore no longer be an expression 
of the true I.Q., but will enable us only to set 
certain limits to the subject’s intelHgence. 

The problem before us, therefore, is as follows ; 
How, from the score obtained in a mental test 
by an adult or an adolescent subject, can that 
subject’s true I.Q. be estimated — i.e., the I.Q. 
which would have been found for him had he 
been tested in childhood ? 

It will be simpler to develop the methods of 

^ Even sEould it be found ultimately that in the case of all children 
intelhgence ceases to grow at approximately the same age, different 
children will have reached at that age very different levels of mental 
development, E.g., the dull or defective child may have attained 
a mental age of only ten or twelve years, at which he remains for 
the rest of his life. 
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solution proposed by reference to a particular 
group test for which the age norms of perform- 
ance are known. For this purpose we shall take 
the Otis group test. All that wiU be said would 



be equally applicable, mutatis mutandis, to any 
other point-scale test. 

I. A Direct Method. Fig. 4 represents the 
curve of performance in the Otis group test, 
as plotted from the table of norms. ^ This curve 
is a rough reflection of the curve of growth of 
intelligence of the ‘ average ’ child. A given 
child’s mental age is found by taking his score in 

^ See Manual of Dtrecttons for Otis group test, p. 65 (Hanap). 
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the test and finding from the curve the age to 
which it corresponds. The percentage ratio of 
this age to the child’s actual age will give his I,Q. 
A glance at the curve in Fig. 4 will show that the 
critical age is between 14 and 15 years (actually 
14 years 5 months — ^see table of norms) and the 
terminal age about 18 years. Below the critical 
age the improvement in score of the average child 
is I point per month, or 12 points per year. 

Let us first consider the case of a fairly bright 
child below "the critical age.^ Suppose at 12 
years of age he scores 102. His mental age is 
then 13 years 2 months, and his I.Q. no. From 
12 to 14 years his intelligence will continue to 
grow at much the same rate, as he is below the 
critical age. At 14 years his mental age should 
therefore be 15 years 5 months. His rate of 
improvement of score will be 10 per cent, above 
1that of the average child {i.e., it will be i.i points 
per month), and if he is again tested at 14 years 
his score will be 102 + (24 x i.i) — ^that is, 

^ In what follows, the not improbable assumption is made that 
the mtelhgence of a given child contmues to bear to the intelligence 
{of the average child nearly the same relation after the critical age 
as before that age, though the rate of growth of mtelhgence of both 
children win then be dimmishmg. 
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about 129. From the curve in Fig. 4 the age 
corresponding to a score of 129 is about 17 years, 
and the ratio of this to 14 years will thus be 
greater than the true I.Q. The reason for this 
discrepancy is, of course, that during the period 
12 to 14 years the child’s mental growth has not 
been following a curve which has been rising less 
and less rapidly like the one in Fig. 4 for a part 
of the time, but has still been increasing at a 
comparatively steady rate. Thus, to obtain the 
true I.Q. we must extend the straight portion 
of the curve in Fig. 4. The age along this 
straight extension corresponding to a score of 
129 will be 15 years 5 months, and the ratio of 
this to 14 years gives no, the true I.Q. In other 
words, when the child’s mental age rises above 
the critical age, we must compare him with a 
fictitious average individual ^ whose intelligence 
(and therefore mental age) continues to grow at 
a steady rate indefinitely, if we are to measure 
the true I.Q. The mental age thus obtained we 
shall call the ‘ effective mental age.’ When the 
actual mental age is below the critical age it 

1 Cf. also Manual of DirecHons for Otis group test, pp 53 ft. 
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will evidently be identical with the effective 
mental age. Clearly this method of finding the 
effective mental age is applicable to a subject 
of any age. 

The above case points the way to the solution 
of the main* problem we are considering — ^the 
determination of the true I.Q. of the adolescent 
or adult. In such cases a difficulty arises addi- 
tional to that found in the case of the bright child 
just below the critical age. For what are we 
to take as the denominator in the I.Q. ratio for 
subjects over the critical age and eventually 
over the terminal age ? 

We may consider the case of the terminal age 
first. Let us now suppose that intelligence, 
instead of slowing down after the critical age and 
gradually coming to a final stop at the terminal 
age, continues to grow at the same constant rate 
as below the critical age, and then comes to a 
sudden stop at the same level as is reached in 
actual fife at the terminal age. Thus we must 
take the score of the average subject in the test 
as continuing to increase by i point per month. 
Now from the curve in Fig. 4 we see that, for the 
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average individual, the ' terminal score ’ — i.e., 
the score reached at the terminal age — ^is 130. 
Had the score still increased after the critical age 
of 14 years 5 months (at which the score is 117) 
at the rate of i point per month, the score of 130 
would have been reached at the age of 15 years 
6 months. We may caU this age the ‘ effective 
terminal age.’ Then for all subjects older than 
the terminal age we must take the effective ter- 
minal age as the denominator of the I.Q. ratio. 
The I.Q. for such subjects will therefore be the 
ratio of the effective mental age (calculated as pre- 
viously described) to the effective terminal age. 

It stiU remains to deal with the case of those 
subjects who are younger than the terminal age, 
but older than the critical age. The procedure 
here is to j&nd the age at which the score corre- 
sponding on the curve in Fig. 4 to the actual 
age of the subject concerned would have been 
reached if the score of the average individual 
continued to increase by i point per month after 
the critical age. The age thus obtained may be 
termed the ‘ effective age,’ and must be used as 
the denominator of the I.Q. ratio for the subject 
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concerned. For subjects below the critical age 
the effective age will evidently be identical with 
the actual age, while for subjects above the 



terminal age it must be taken as equal to the 
effective terminal age. 

The foregoing may perhaps be made clearer 
by an example illustrated by the diagram given 
in Fig. 5. 

The curve ABC in Fig. 5 is the curve of per- 
formance in the Otis test as in Fig. 4. BK 
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is the extension of the straight portion AB of 
the curve. B is at the critical age, C at the ter- 
minal age, and M at the effective terminal age. 

Suppose a child of just 15 years makes a score 
of 145. The point P on BK gives the effective 
mental age as 16 years 9 month^. The point 
Q on the curve ABC corresponds to the actual 
age of 15 years, at which the average score is 
121. QL is parallel to the age axis, and the 
point L on BK gives the effective age as 14 
years 9 months. Thus the true I.Q. — that is, 

the percentage ratio of the effective mental age 

, , , . .16 years g months 

to the effective age-is 14 years 9 monthi 

i.e., — X 100, or 114. 

177 

The procedure for finding the mental age of an 
adult or adolescent is therefore briefly as follows : 

From the subject’s score obtain the effective 
mental age from the straight line ABK. From 
the curve ABC obtain the score corresponding 
to the subject’s actual age, and hence from the 
straight line ABK the effective age correspond- 
ing to this score. ^ The table of norms can, of 

^ Botii theory and practice show that the most probable form of 
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course, be used instead of the curve, provided 
we remember that in finding the effective age 
and the effective mental age the average score 
must be supposed to increase at the same rate 
above the critical age as below that age. The 
true I.Q. will then be given by the formula 


effective mental age 
effective age 


X 100, which is more general 


than the older formula x for the 

latter is applicable only when the mental age and 
the actual age are both below the critical age. 

The foregoing cannot be applied, except in a 
modified form, to such scales as the Stanford 
Revision of the Binet tests. In the latter scale 
Terman takes the effective terminal age as i6 
years, and arbitrarily fixes the value in months 
of the tests in the ‘ average adult ’ and ‘ superior 


the curve of performance in a group test is the curve known as an 
' ogive * If, however, an ogive is much drawn out it approximates 
closely to a straight line over a considerable range This is what 
usually happens, as in the Otis test. If the data permit the plotting 
of the ogive, intercepts for finding efiective mental age would then 
be made on the latter instead of on the simple extension of the 
nearly straight portion , but the difference will be practically 
important only for scores approachmg the maximum possible. It 
should be noticed that the drop m the ordmary curve of performance 
after the cntical age is due to the slackening growth of intelligence, 
and not to the tendency toward the ogive form 
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adult ’ groups in such a way as to cause adults to 
test on the average at a mental age of i6 years. 
The method outlined here could thus be applied 
to the Stanford Revision only after an investi- 
gation had been made to determine the actual 
average performance of subjects from about 14 
to (say) 20 years of age in the upper groups of 
the scale, and a curve of performance plotted 
from the results of this investigation apart from 
any arbitrarily predetermined value in months 
for the harder tests. 

To test satisfactorily the method that we have 
been discussing it would be necessary to retest 
the same children at intervals during adolescence 
in order to discover whether the I Q. calculated 
by this method remained nearly constant for 
the same child and equal to its value below the 
critical age. Unfortunately, such retests with 
the Otis scale are not at present available, but, 
in their absence, artificial cases may be invented 
by the following device : 

A child of given age and given score at that 
age is assumed. Knowing this, the distnbution 
tables at the end of the Otis Manual of Directions 
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will enable the child’s percentile rank (P.R.) to 
be approximately calculated. On the basis of 
this P.R., examination of the distribution in the 
other age groups wall make it possible to esti- 
mate this same child’s probable score at other 
ages, since his P R. remains constant, due 
allow^ance being made for the fact that the higher 
age groups are not fully representative. Thus, 
for example, if his P.R. is 70, to find his probable 
score at 12 years of age look at the distribution 
of scores for^the age group 12, and find the score 
of P.R. 70. 

Two examples may be given in illustration, 
one of a bright and one of a dull child. ^ 

(i) A bright child who at 16 makes a score of 
180. The probable scores of this child for 
various ages were found, and hence his effective 
mental ages, which, with the effective ages, gave 
the values of his I.Q. 

The results are given in the table on p. 69. 

Hence for this case the method gives a nearly 
constant I.Q. Note that by the ordinary 

1 These two examples were suggested to the wiiter (by Mr Burt) 
as a test of his method 
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method this child would at 12 be supposed to 
have a mental age of 18 (130 being the average 
score at 18), giving an I.Q. of 150, very wide 
of the mark. Above 12 years it would not 


Actual 
Age of 
Child 
(Years) 

Probable 

Score 

Effectne 

Mental 

Age 

(Months) 

A nera^t. 

ore at 
Child’s 
Artua) 'Vge 

FfTectue 

A,ge 

(Months) 

True I Q 

CO 

H 

182 

238 

130 

186 

f|-|x 100=128 

16 

180 

236 

125 

181 

Ilf X 100=130 

15 

173 

229 

122 

178 

f||x 100=129 

14 

160 

216 

II2 

! 168 

1 

IJfx 100=129 

12 

130 

180 

1 88 

1 

144 

Ilf X 100=132 

10 

104 

160 

64 

120 

P|§X 100=133 


be possible to assign him a mental age by the 
ordinary method. 

(2) A dull child of I.Q. about 70. This child’s 
mental age when 12 years old would then be ' 
about 8 years 5 months, and therefore his score 
would be 45 (see table of norms). 

The results are given in the table on p. 70. 

Here the variation of I.Q. is too great to be 
satisfactory, but it should be noted that the 
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probable scores obtained for this child by the 
P.R. method for ages above 14 are certainly 
considerably higher than his scores would actu- 
ally be in practice. For the probable scores 


Actual 
Age of 
Child 
(Years) 

Probable 

Score 

Effective 

5 Mental 
Age 

(Months) 

Averitg^e 
Score at 
Child s 
Actual Age 

Effective 

Age 

(Months) 

True I Q 

10 

30 

86 

64 

120 

T®2%XI00= 72 

12 

45 ' 

lOI 

88 

144 

J^xioo= 70 

14 

68 

124 

II 2 

168 

ill X 100 = 74 

15 

00 

H 

137 

122 

178 

HfXioo= 77 

16 

90 

146 

125 

181 

J|-fxioo= 80 

18 

99 

155 

130 

186 

ill X 100= 83 


make his rate of increment of score after 14 
greater than that of the average child, which 
would clearly not be the case Hence in practice 
our method would give an I.Q. constant within 
limits considerably narrower than is apparent 
from the example. It simply happens that the 
P.R. method, which is here an artificial device 
dependent for its accuracy on the accuracy 
and comprehensiveness of the distribution 
70 
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tables, does not, in this case, give probable 
scores sufficiently near to the true scores. 

Now let p stand for the child’s score and q 
for the average score at his age. Then it will be 
seen from the above examples that his effective 
age is q + 56, and his effective mental age 

P + 56, so that his true I.Q. is ^ x 100. 

Similarly, for the Terman group test we should 
have, in consequence of the formula obtained 

in the last chapter, true I.Q. = f ^ ^ ■ x 100. 

Evidently it would be quicker and simpler to 
find the true I.Q.s from formulae like these than 
direct from the curve as in Fig. 5. But the in- 
troduction and discussion of the latter was 
necessary to make the principle of the method 
clear. As a matter of fact, in practice, once the 
norms have been established (and hence the 
critical and terminal ages) for any group test, 
the best thing to do would be to construct, by 
means of a formula like the above, a ready- 
reckoner giving true I.Q. direct from score and 
actual age. 

The results we have arrived at may perhaps 
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be best summarized by a recapitulation of the 
definitions of the new terms used. 

The critical age is the age at which intelligence 
ceases to grow at an approximately constant 
rate, and its development begins to slow down. 

The terminal age is the age at which intelh- 
gence stops growing altogether. 

The terminal score is the average score reached 
at the terminal age. 

The effective terminal age is the age at which 
the terminal score would be reached if intelli- 
gence continued to grow after the critical age 
at the same rate as before the critical age. 

The effective mental age of a subject is the age 
at which the score he makes would be reached 
by him if his intelligence continued to grow 
indefinitely at the same rate as below the 
critical age. When the effective mental age is 
less than the critical age it is identical with the 
actual mental age. 

The effective age of a subject is the age at which 
the average score corresponding to his actual 
age would be reached if inteUigence continued 
to grow at the same rate after the critical age as 
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before that age. If the effective age is less than 
the critical age it will be identical with the actual 
age ; if it is greater than the terminal age it must 
be taken as equal to the effective terminal age. 

The general formula for the I.Q. then be- 
comes the percentage ratio of the effective 
mental age to the effective age. This formula 
gives the true I.Q. for adults and adolescents — 
that is, the I.Q. which characterized them when 
they were below the critical age. 

For the Otis group test the critical age is 
14 years 5 months, the terminal age 18 years, 
and the effective terminal age 15 years 6 months. 
It will be important to determine whether the 
critical, terminal, and effective tenninal ages 
are {a) approximately the same for different 
children, (J) approximately the same for dif- 
ferent test scales. 

11. An Indirect Method. The writer has 
also obtained estimates of true I.Q.s by means 
of a percentile rank method. This method 
rests on two facts : 

(i) Th-t percentage distribution of I.Q.s in any 
particular age group, if sufficiently large, will 
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approximate closely to the percentage dis- 
tribution of I.Q s for the whole population. 
This simply means that any two large age 
groups will probably have about the same 
distribution of I.Q.s. 

(2) Within a given age group the distribution 
of I.Q.s will be the same as that of the mental 
ages, since the denominator of the I.Q. ratio 
will be approximately the same for every 
individual in the group. The narrower the 
limits of age between which the group is taken, 
the nearer will the agreement be. 

In order to use this method two things mast 
be known : 

(a) The general distribution of I.Q.s below 
the critical age (these being true I.Q.s). 

(&) The distribution of scores (and therefore 
of mental ages) in an age group with a narrow 
sge range about the age of the particular indivi- 
dual who is being dealt with. 

The first is now fairly well known, as is also the 
second in the case of some group tests. 

The method, which is simple enough to apply, 
needs only a brief description. Knowing the 
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The method we have been discussing is 
applicable, with slight modifications, to the 
Stanford-Binet individual scale, as explained 
in the appended note. 


NOTE ON A METHOD OF ESTIMATING THE 
TRUE STANFORD-BINET I.Q s OF ADULTS 

The following method of estimating the true 
Bmet (Stanford Revision) I.Q s of adults, or of 
children approaching i6 years of age, which has 
occurred to the writer m the course of his 
general inquiry into methods of estimating the 
true I.Q.s of adults and adolescents, may perhaps 
be of interest to those engaged on Binet tests. 

The difficulty which occurs in this connexion 
with the Bmet scale lies in the fact that, owing 
to lack of extensiveness in the upper ranges, 
I.Q.s of adults, as measured by the scale, are 
bound to drop below their true values. The 
greatest possible I.Q. at which an adult can 
test is about 126. 

For the method to be used two things must 
be known : (i) the distribution of children's 
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Binet I.Q s ; (2) the distribution of the mental 
Siges of adults as measured in the ordmary 
way by the Bmet scale. 

The first is fairly well known (cf. Terman, 
TAe Meastirement of Intelligence (Harrap), pp. 
66 and 78), but there are not yet sufficient data 
to obtain the second accurately, though Terman 
gives a diagram based on 62 ' normal ’ cases 
on p 55 of the book referred to. The writer 
has, however, found the latter useful as a rough 
working basis. {N.B. In Terman’s diagram on 
p. 55, 15. II seems to be a mispnnt for 16. ii ) 

To find true Binet LQ.s proceed thus : Hav- 
ing obtained the mental age of any particular 
adult in the ordinary way, find his P.R. from 
the distribution table for mental ages of adults. 
Since, in finding the I.Q., the actual age is 
taken for every adult as 16 years, the P.R. 
obtained will also be the given person’s P.R. 
as regards I.Q. Then from the distribution 
tables for children’s I.Q s, which are true LQ.s 
(the distribution remaining the same for 
different age groups), it is possible to find 
directly the approximate value of the given 
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adult's true I.Q., his P.R. having been found 
as explained above. 

In conclusion, two points should be noted : 
(i) for I.Q.s below about 105 the value ob- 
tained in the ordinary way is probably very 
nearly the true one ; and (2) Terman's distri- 
bution diagram for adults can be used only up 
to mental ages of about 18 years 6 months. 
When more extensive data are available for 
obtaining this distribution the percentile rank 
method should give I.Q. values of consider- 
able accuracy. 

Table of Provisional Values obtained 
BY P.R. Method 


Mental Age found 
by Usual Method 

True I Q 

16 years 6 months 

107 

i6 „ 

9 .. 

108 

17 

0 „ 

no 

17 » 

3 » 

II 2 

17 - 

6 

II5 

17 .. 

9 .. 

I18 

18 „ 

0 „ 

122 

t8 

3 .. 

126 

18 

6 

132 
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THE RELIABILITY OF THE GROUP INTELLI- 
GENCE TEST AS AN INDEX OF EDUCABILITY 

I N Chapter II we discussed an experiment 
the results of which pointed to the fact 
that the Stanford-Binet scale provided an 
accurate means of gauging the educable capacity 
of a child. This scale is, however, administered 
individually, and thus involves the expenditure 
of a considerable amount of time, especially 
when large groups of children are to be tested. 
In practice, therefore, it is necessary to employ 
group tests which can be applied to a great 
many children simultaneously, retaining the in- 
dividual scale as a standard of reference and a 
source of further information in particular cases. 

The question therefore arises as to the 
reliability of the group test as an index of 
educability. There are two methods of ap- 
proaching the problem, one indirect, the other 
direct. The first method consists in comparing 
the results of the group test with those of the 
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individual test, the reliability of which has 
already been largely established. The second 
method consists in comparing the results of 
the group test with those of ordinary educa- 
tional tests which may be taken as affording 
reasonably accurate estimates of the educable 
capacities of children who have been at school 
for some years. Experiments illustrating both 
methods will now be described. 

I. The Indirect Method The group of 
children tested individually by the S.B. scale, 
as described in Chapter II, was afterward 
given a typical group intelligence test. It 
should be noted that the group tests at present 
in vogue, though they differ from one another 
in detail as regards both particular forms of 
tests employed and particular items of different 
tests, conform largely to a type. If the 
reliability of such a typical group test can be 
estabhshed, it therefore follows that any group 
test of similar type, provided it is reasonably 
comprehensive, and provided it has been shown 
to be sufficiently suitable by trial, is likely to be 
equally|reliable, 
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In the case with which we are dealing the 
%ures obtained from the comparison of the 
results of the individual and the group tests 
were considered for each batch of children 
separately (see Chapter II) as well as for the 
group as a whole. 

The rank correlations for the five batches be- 
tween scores in the group test and mental ages 
obtained by the individual test were as follows : 

Group A Group B Group C Group D Group E 
.86 .86 .91 88 * .92 

When the group was taken as a whole the 
rank correlation worked out at .89. 

It will be seen that these correlations are 
high, and it may therefore be safely concluded 
that the group test is, on the whole, a suf&ciently 
reliable index of educability. 

On examining the results in detail two con- 
clusions were drawn ; 

(1) The group test is a very good method of 

ordering a number of children in respect 
of intelligence, with sufficient accuracy. 

(2) The group test is not so rehable as the 
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individual test for assigning the exact 
mental level of a particular child. Though 
in general the agreement between the 
two tests is close, in particular cases 
the group test may rate a child too low 
or too high. 

The factor which leads to the second con- 
clusion seems to be that the group tests at 
present in use are not so comprehensive as the 
individual test in the appeal they make to 
the intelligence. The mental capacity of each 
child tested in the experiment here described 
was analysed as far as possible under thirteen 
heads, ^ all of which appear to be tested speci- 
fically by the S.B. scale, but some of which are 
not tested specifically by the group scale. In 
the cases of discrepancy scrutiny of the analysis 
revealed, almost invariably, that the child had 

^ These were : (i) Readiness and ability m applying knowledge ; 
(2) Power to discriminate essentials , (3) Richness and logical 
integrity of the associative processes j (4) Ability to control and 
concentrate attention , (5) Power of comprehension ; (6) Abihty 
to hold m mind the conditions of a problem ; (7) Ingenuity and 
practical judgment ; (8) Steadmess of purpose , (9) Power of 
formmg abstract ideas ; (lo) Power of generalization ; (ii) Critical 
ability ; (12) Abihty to manipulate imagery (especially visual 
imagery) * (13) Development of social consciousness, 
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shown up weU (or badly, as the case might be) 
in the individual test in certain directions which 
were not specifically covered by the group test. 
The remedy lies, of course, in making our group 
scales so comprehensive that we may be assured 
of their equal reliability with the individual scale, 
not only in general, but also in particular cases. ‘ 
II. The Direct Method. In Chapter III 
an experiment was described in which some 
500 children were tested by the Terman group 
intelligence scale, and the method of deriving 
the children’s mental ages and I.Q s from their 
scores in the test was explained 

Now these children had also been tested by 
an ordinary written examination in arithmetic 
and English, so that data were available for 
comparing the results of the intelligence test 
with those of the ordmary educational test. 

Before the comparison could be effected it wa 5 
necessary to express the results of the written 
examination in terms similar to those in which 
the results of the intelligence test were expressed. 
This was done by means of a procedure exactly 

^ Cf. footnote at end of Chapter III. 
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analogous to that descnbed in Chapter III. 
That is, the age norms of performance m the 
written examination (taking the arithmetic 
and English together) were established by 
finding the average scores of the children in the 
age groups loj, io|, iij, iif, laj, I2f years. 
A curve of performance was then obtained, and 
by comparing the score of a particular child 
with this curve it was possible to assign to him 
a ‘ mental age ’ indicative not simply of native 
intelligence, but of educational attainment. 
Dividing this quantity by the child’s actual age, 
and expressing the result as a percentage, a figure 
was obtained which might be fairly taken as an 
index of the child’s educability, and was there- 
fore termed the ' educability quotient ’ (E.Q.).^ 
At this point it should be noted that the 
papers in the written examination were cor- 
rected in the ordinary way by a single examiner. 
A subjective factor was thus involved of a 
kind which did not enter into the intelligence 
test, the scoring of which was purely mechanical 
and independent of any particular examiner. 

^ Cf Chapter II, p. 29 
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It was therefore to be ex;pected. that the results 
of the written examination would not exhibit 
the same regularity as those of the intelligence 
test. This was in fact the case ; nevertheless 
the results were sufficiently regular to render the 
fitting of a curve of performance to the data 
a matter of little doubt or difficulty. 



Percentage of Children 


Boys 

Girls 

E Q below IQ.. 

242 

163 

E Q and I Q. corresponding 

557 

561 

E Q above I Q. 

20 1 

27.6 


Now it IS reasonable to suppose that if a child 
is educated in sufficiently favourable circum- 
stances his attainment should keep pace with 
his intelligence, so that, provided our intelli- 
gence tests are reliable, a close correspondence* 
should appear between the I.Q s and the E.Q.s. 

The results of this experiment are given in the 
above table (I Q. and E.Q. are said to correspond 
when they are withm five points of one another). 

These figures speak for themselves, and it is 
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evident that the agreement between the results 
of the two kinds of test is strikingly close in su 
very considerable proportion of cases. Com- 
plete agreement could not in any case be 
expected, for many causes may militate against 
close agreement between the I Q. and E Q. of 
a particular child. Past or present illness, 
absence from school, late entry, bad teaching, 
laziness, will all tend to cause the E.Q. to drop 
below the IQ.; whereas a temperament more 
industrious than the average, good teaching, 
and similar causes may make the E Q. rise above 
the I.Q , it being remembered that these two 
indexes are obtained by comparison with the 
average. In this experiment, of course, the 
average was not that of a sample of the whole 
population, but of this particular group of 
children, which (as explained in Chapter III) 
was somewhat above the average of the whole 
population. In obtaining the I.Q. it was of 
course necessary to use first formula given in 
Chapter III. 

The distributions were given by the following 
figures : 
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EQ. 

No of 
Boys 

No of 
Girls 

Total 

Under 80 

9 

25 

34 

81-90 

28 

70 

98 

91--100 

49 

87 

136 

IOI~IIO 

55 

67 

122 

III~I 20 

19 

30 

49 

121-130 

9 

12 

21 

131 and over 

6 

6 

12 


175 

297 

1 

472 


I.Q 

No of 
Boys 

No of 
Girls 

Total 

Under 80 . 

19 

35 

54 

81-90 

27 

73 

100 

91-100 

41 

85 

126 

lOI-IIO 

40 

57 

97 

111-120 

24 

28 

52 

121-130 

15 

17 

32 

131 and over 

8 

2 

10 


174 

297 

471 


These figures give a median of just imder 
100 for both I.Q. and E.Q., but the semi-inter- 
quartile range shows a slightly greater scatter 
for the former than for the latter, a result 
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similar to that found in the experiment de- 
scribed in Chapter II. It probably points to a 
certain insufficiency in the elasticity of pro- 
motion so far as the brighter children are 
concerned. 

Summing up, it may be said that the results 
thus obtained, both by the indirect and by the 
direct methods, afford good grounds for regard- 
ing tests of the group scale type as useful and 
sufficiently accurate means of estimating educa- 
bility, while . this reliability is likely to be 
increased when our tests are so devised as to 
probe the child’s intelligence from as many 
directions as possible. 
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CHAPTER VI 
CONCLUSION 

W E may now consider, very briefly, the 
conclusions to which we are led by 
the results of the experiments which 
have been described. In the first place, it will 
probably be admitted that these experiments go 
some way toward establishing the reliability 
of intelligence tests in carrying out the special 
purpose for which they were designed — the 
separation and grading of children according 
to degrees of educability Indeed, the general 
correspondence between the children’s per- 
formances in the tests and the levels of educa- 
tional attainment which they had shown them- 
selves capable of reaching was really ver5> 
striking indeed. It is therefore evident that 
intelligence tests should no longer be regarded 
with suspicion as a mere ‘ stunt,’ but should 
be recognized as a valuable instrument capable 
of affording great aid in the advancement of the 
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theory and the practice of educational science. 
The task of the future is to perfect them. 

Secondly, it must not be forgotten that 
intelligence is not the only factor necessary for 
success in school and in after life, though it is 
the most important. Unless he possesses a 
certain minimum of intelligence the child wiU 
not be capable of profiting by advanced educa- 
tion ; but even if he exceeds this minimum 
other factors are necessary to his successful 
progress. Not only must he reach a certain 
standard of physical fitness, but his tempera- 
ment and character must be such that he is 
impelled to use his intelligence effectively. It 
is just here that the real value of the ordinary 
educational tests of attainment becomes ap- 
parent. For we want to know not only whether 
a child is inteUigent, but also whether he is 
capable of making good use of his intelligence. 
Intelligence tests are therefore most valuable 
when their results are scrutinized in comparison 
with the results of educational tests together 
with records of school careers. But if our 
educational tests are to attain their full value 
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an attempt must be made to standardize them 
as effectively as intelligence tests are stan- 
dardized, and also to interpret their results, in 
the case of each child, by an appropriate age 
allowance carried out by some such method 
as that described in Chapter V. Unless dealt 
with in this way educational tests lose much of 
their significance. All cases m which a marked 
discrepancy is found between the results of the 
intelligence and of the attainment tests respec- 
tively should be made the subject of further 
study. 

Finally, a word may be said as to the adminis- 
tration of the tests. While group testing can be 
carried out by any reasonably competent person 
who is willing to take the trouble to foUow 
exactly the prescribed procedure (which is 
becoming more and more simple as the tests are 
improved), it is probable that the administra* 
tion of the individual tests should be left to 
persons having some expert knowledge of the 
subject. In any case detailed interpretation of 
results is a matter for the psychologist. For 
this reason, as well as on account of the time 
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factor, practical testing in the future will in- 
evitably (and quite rightly) be carried out 
mainly by the group method. The individual 
scale will remain as an ultimate standard of 
reference, to which recourse will be had for 
purposes of comparison, and for dealing with 
especially interesting and difficult cases, whether 
supernormal, subnormal, or abnormal. 
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A LIST OF STANDARD 

INTELLIGENCE TESTS 

TERMAN GROUP TEST OF MENTAL ABILITY 

By Dr Lewis M Terman 

For use in the upper standards of elementary schools and in 
secondary schools Test can be completed in 35 minutes 
Examination Form A : 25 Examination Forms, one Key 
Form, one Manual of Directions, 6s. 100 Examination Forms, 

one Key Form, one Manual of Directions, 21s. 

Examination F orm B : Prices, etc , as for Form A 
Specimen Set, containing (i) one copy of Examination 
Forms A and B ; {2) Keys for Forms A and B , (3) Manual 
of Directions 2s, net, in envelope 

OTIS GROUP INTELLIGENCE SCALE 

By Arthur S Otis. 

An absolute point scale to measure in groups the mental ability 
of children throughout the elementary school and upward. 
Primary Examination Forms A and B : Eight tests 
and a page for recording the performance of the examinee 
Adapted for pupils from 6 to 9 years of ^ge 25 copies of 
either Form and one Class Record, 5s. 100 Forms and one 

Class Record, 17s. 6d. 

Advanced Examination Forms A and B : Ten tests 
and a page for recording the performance of the examinee 
Adapted for pupils of 10 years and upward 25 copies of 
either Form and one Class Record, 5s. 100 Forms and one 

Class Record, 17s, 6d. 

Manual of Directions 72 pages Is, 6d. net 
Examiner’s Key for Advanced Examination: 20 pages. 
Is. net 

Specimen Set, containing (i) one copy of each of the 
Primary Forms and one each of the Advanced , (2) Manual 

of Directions , (3) Examiner's Key , (4) Class Record 3s. 6d, 
net, in envelope 

NATIONAL INTELLIGENCE TESTS 

By Professors Haggerty, Terman, Thorndike, Whipple, 
and Yerkes. 

An adaptation for English school purposes of the group intel- 
ligence tests used in the examination of recruits in the U S. Army. 
Scale A, Form 2 : 25 Examination Forms and two Key 
Forms, 6s. 100 Examination Forms and two Key Forms, 21s, 

Scale B, Form 2 ; 25 Examination Forms and one Key 
Form, 6s. xoo Examination Forms and one Key Form, 21s. 
Manual of Directions : Is. net 

Specimen Set, containing (i) Examination Forms A and 
B , (2) Keys for Forms A and B j {3) Manual of Directions, 
2s. 6d, net, in envelope 



THE SIMPLEX GROUP INTELLIGENCE SCALE 

By C A IRichardson, MA, Author of MetJiods and Expert* 
ments tn Mental Tests 

No elaborate directions or timing are required in the use 
of this comprehensive Group Intelligence Scale, which is suit- 
able for all ages above nme years. 

Test Booklet ; 25 copies, with Examiner’s Report, 6s* 

100 copies, with 4 Examiner’s Reports, 21s. 

Manual of Directions : Is. net 

Specimen Set, containing (i) Test Booklet ; (2) Manual 

of Directions, (3) Examiner’s Stencil Key 3 s. net, in 
envelope 

THE NORTHUIV^BERLAND MENTAL TESTS 

By Professor Godfrey H. Thomson, Armstrong College, 
N ewcastle-on-T y ne 
A simple and very effective senes 

Examination Form 1 : 25 copies, 5s. loo copies, 17s. 6d. 
Examination Form II : 25 copies, 5s. 100 copies, 17s. 6d, 
Teacher’s Manual : Is. net 

Specimen Sets I and 11, each cnntrmmg (i) Examina- 
tion Form, with Practice Test , ( 2 ) J ojo .01 -Ma ia.*l Is. 6d. 
net, in envelope 

COURTIS STANDARD PRACTICE TESTS IN 
ARITHMETIC 

By S A Courtis, Supervisor of Educational Reseaich in the 
Detroit Pubhc Schools 

A series of forty-seven graded lessons on cards in two forms — 
A and B — ^mcludmg five research tests and eight cards for 
special study, covering the anthmetical operations wnth whole 
numbers for the elementary standards 

144 Practice Cards, sufficient for twelve pupils , in strong 
envelope, 7s. 6d. Student’s Record and Practice Pad : 
Is. Teacher’s Manual : Is. 6d. net Lesson Cards, 1 to 
48, Form A : 2s. 6d. Lesson Cards, 1 to 48, Form B : 
^ 2s. 6d. 

Specimen Set, contaming (i) one set of Lesson Cards, 
I to 48, Form A , (2) one set of Lesson Cards, i to 48, Form B , 
(3) Student’s Record and Practice Pad , (4) Teacher’s Manual 
5 s. net, m envelope. 

Full paritculars of the above Tests and a hst of New 
and Recent Books for Teacherswtll he sent on request 


GEORGE G. HARRAP 3 COMPANY LTD. 
2 3 3 PORTSMOUTH STREET KINGSWAY W.C. 
CALCUTTA AND SYDNEY 




